AITopics | Lugo

Collaborating Authors

Lugo

UrbanCross: Enhancing Satellite Image-Text Retrieval with Cross-Domain Adaptation

Zhong, Siru, Hao, Xixuan, Yan, Yibo, Zhang, Ying, Song, Yangqiu, Liang, Yuxuan

arXiv.org Artificial IntelligenceApr-22-2024

Urbanization challenges underscore the necessity for effective satellite image-text retrieval methods to swiftly access specific information enriched with geographic semantics for urban applications. However, existing methods often overlook significant domain gaps across diverse urban landscapes, primarily focusing on enhancing retrieval performance within single domains. To tackle this issue, we present UrbanCross, a new framework for cross-domain satellite image-text retrieval. UrbanCross leverages a high-quality, cross-domain dataset enriched with extensive geo-tags from three countries to highlight domain diversity. It employs the Large Multimodal Model (LMM) for textual refinement and the Segment Anything Model (SAM) for visual augmentation, achieving a fine-grained alignment of images, segments and texts, yielding a 10% improvement in retrieval performance. Additionally, UrbanCross incorporates an adaptive curriculum-based source sampler and a weighted adversarial cross-domain fine-tuning module, progressively enhancing adaptability across various domains. Extensive experiments confirm UrbanCross's superior efficiency in retrieval and adaptation to new urban environments, demonstrating an average performance increase of 15% over its version without domain adaptation mechanisms, effectively bridging the domain gap.

adaptation, retrieval, urbancross, (10 more...)

arXiv.org Artificial Intelligence

2404.14241

Country:

Europe > Finland (0.08)
Europe > Germany (0.07)
Europe > Spain > Galicia > Madrid (0.04)
(6 more...)

Genre: Research Report > New Finding (0.67)

Industry:

Energy > Renewable > Solar (1.00)
Energy > Power Industry (0.68)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
(3 more...)

Add feedback

Zero-shot Triplet Extraction by Template Infilling

Kim, Bosung, Iso, Hayate, Bhutani, Nikita, Hruschka, Estevam, Nakashole, Ndapa, Mitchell, Tom

arXiv.org Artificial IntelligenceSep-20-2023

The task of triplet extraction aims to extract pairs of entities and their corresponding relations from unstructured text. Most existing methods train an extraction model on training data involving specific target relations, and are incapable of extracting new relations that were not observed at training time. Generalizing the model to unseen relations typically requires fine-tuning on synthetic training data which is often noisy and unreliable. We show that by reducing triplet extraction to a template infilling task over a pre-trained language model (LM), we can equip the extraction model with zero-shot learning capabilities and eliminate the need for additional training data. We propose a novel framework, ZETT (ZEro-shot Triplet extraction by Template infilling), that aligns the task objective to the pre-training objective of generative transformers to generalize to unseen relations. Experiments on FewRel and Wiki-ZSL datasets demonstrate that ZETT shows consistent and stable performance, outperforming previous state-of-the-art methods, even when using automatically generated templates. https://github.com/megagonlabs/zett/

computational linguistic, relation, template, (14 more...)

arXiv.org Artificial Intelligence

2212.10708

Country:

Europe > Switzerland (0.05)
Europe > Hungary (0.05)
North America > Jamaica > St. James > Montego Bay (0.05)
(23 more...)

Genre: Research Report (1.00)

Industry: Leisure & Entertainment > Sports (0.48)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback